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1 Introduction 



A relative equilibrium of a Lagrangian system which is invariant under a Lie group G is a motion 
of the system which coincides with an orbit of a 1-parameter subgroup of G. Relative equilibria 
are of interest in particle dynamics [8j [9j [T3] of course, and also in Riemannian and Finsler 
geometry [HE! [15], though there they are often studied under different names. 

In this paper we consider Lagrangians which are arbitrary apart from being subject to certain 
regularity conditions, with symmetry groups which are also arbitrary except for being required 
to act freely and properly on the space. We prove a very general criterion for finding relative 
equilibria: a relative equilibrium is a critical point of the restriction of the energy to a level set of 
the momentum. We discuss the relation between this result and a different criterion for relative 
equilibria given by Lewis [9]. We also consider in some detail two special cases, namely the 
case where the configuration space is a Lie group [5j [16] and the case of a simple mechanical 
system [TT], and in the context of the latter we make some remarks about the so-called Saari 
conjecture [8j. One of our purposes in this paper, indeed, is to provide a single framework for 
a variety of results about the conditions for the existence of relative equilibria both in general 
and in particular circumstances. 

We shall use methods based on the consideration of frames adapted to the group action, and 
velocity variables associated with such frames, variables which are sometimes called quasi- 
velocities. To the best of our knowledge the study of relative equilibria by such methods has 
not been carried out before, at least in recent times. We have already used these methods in 
studying other aspects of dynamical systems with symmetry [21 [3j [12], and some derivations 
which are passed over rather quickly here are dealt with at somewhat greater length in these 
references; nevertheless the present paper is designed to be reasonably self-contained. 

The basic relevant facts about group actions are discussed in Section 2. Section 3 is devoted 
to explaining our approach to Lagrangian theory. The main result is proved in Section 4. In 
Section 5 the alternative criterion for the existence of relative equilibria due to Lewis is derived 
using our formalism. The applications are discussed in Section 6. 
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2 Preliminaries 



Suppose that ip M : G x M — > M is a free and proper left action of a connected Lie group G on a 
manifold M. (In using left actions we follow the convention of Marsden and Ratiu [101 II 1| . Other 
authors, including for example Kobayashi and Nomizu [6], use right actions; as a consequence 
our formulae may differ in sign from those to be found elsewhere in the literature.) The manifold 
M is therefore a principal fibre bundle with group G, over a base manifold B say. Let Q be the 
Lie algebra of G. For any £ G Q, £ will denote the corresponding fundamental vector field on M, 
that is, the infinitesimal generator of the 1-parameter group V'exp(^) °^ transformations of M. 
Since G is connected, a tensor field on M is G-invariant if and only if its Lie derivatives by all 
fundamental vector fields vanish. In particular, a vector field X on M is invariant if [£, X] = 
for all £ G g; indeed, it is sufficient that [i? a ,X] =0, a = 1,2, .. . , dim(g), where {E a } is any 
basis of Q. 

We will work with a (local) basis {Xi, E a } of vector fields on M adapted to the bundle structure, 
where the E a are fundamental vector fields corresponding to a basis of Q, and the Xi, i = 
1, 2, . . . , dim(.B), are G-invariant. To obtain such invariant vector fields we may introduce a 
principal connection on M and a local basis of vector fields on B (a coordinate basis for example), 
and take for the Xi the horizontal lifts to M of these vector fields, relative to the connection. We 
call such a basis {Xi, E a } a standard basis. The pairwise brackets of the elements of a standard 
basis are 

[Xi,Xj] = RijE a , [Xi,E a ] = 0, and [E a ,Ej,\ = —C^ b E c : 

the Rfj are the components of the curvature of the connection, regarded as a g-valued tensor 
field, and the C c ah are the structure constants of Q with respect to the basis {E a } (the minus sign 
occurs because the fundamental vector fields behave as right, not left, invariant vector fields on 
G). 

Since we will be concerned with Lagrangian functions and their corresponding Euler-Lagrange 
equations we must consider also certain geometrical structures on the tangent bundle of M, 
which will be denoted by r : TM — ► M. One important idea is that of lifting vector fields from 
M to TM. There are in fact two canonical ways of carrying this out (see for example [HE] for 
more details on the following material). Let Z be a vector field on M. The complete or tangent 
lift of Z to TM, Z°, is the vector field whose flow consists of the tangent maps of the flow of Z. 
The vertical lift of Z, Z v , is tangent to the fibres of r and on the fibre over m G M coincides 
with the constant vector field Z m . Then Tt{Z°) = Z while Tt(Z v ) = 0. Complete and vertical 
lifts satisfy the following bracket relations: 

[Y c , Z°] = [Y, Zf, [Y°, Z v ] = [Y, Z] v , and [Y v , Z v ] = 0. 

From a standard basis {Xi,E a } on M we may construct a standard basis {Xf , E%, XJ , E^} on 
TM by taking complete and vertical lifts. We will need to use the following bracket relations 
satisfied by these vector fields: 

[ E ai X i \ = [ E a' X i\ = °) [E%> E b] = ~ C ab E ci [ E ai E b \ = ~Cab E c '■ 



We can use any basis of vector fields {Z a } on a manifold M to introduce fibre coordinates on 
TM, simply by taking the coordinates of a point u in the fibre over m to be the components of 
u G T m M with respect to the basis {Z a \ m } of T m M; such fibre coordinates are sometimes called 
quasi- velocities, and we will follow this practice. We can specify quasi-velocities more succinctly 
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as follows. Let {9 a } be the basis of 1-forms on M dual to the basis {Z a } of vector fields, and 
for any 1-form 9 on M let 6 denote the function on TM defined by 9(m, u) = (u,8 m ). Then the 
functions 9 a are the quasi- velocities corresponing to the Z a . The calculation of the derivatives 
of quasi-velocities along complete and vertical lifts of basis vector fields is carried out with the 
use of the following formulae: 

Z c {9) = C^9, Z Y {8) = t*8(Z). 

In particular, Z N a {¥) = flg. 

Consider now a standard basis {Xi, E a }. We write (v l ,v a ) for the corresponding quasi-velocities. 
Using the formulae above, we obtain 

X?(vi) = 0, Xy(vi) = 5{, Xf{v a ) = -Rfjvi, X?( v a ) = 0, 
4°K) = 0, =0, E°{v b ) = C b ac v c , K{v b ) = 5 b a . 

It will sometimes be convenient to use a slightly unconventional notation for points in TM: we 
will denote such points in the form (m, v' l ,v a ), where (v l ,v a ) are the quasi-velocities of a point 
in T m M with respect to a specific standard basis. 



3 The Euler-Lagrange equations 

We next explain our approach to Lagrangian theory, beginning with the general situation where 
no symmetries are assumed. 

A Lagrangian L is a function on a tangent bundle TM (we deal only with the autonomous case) . 
Take local coordinates (x a ) on M and the corresponding local coordinates (x a , u a ) on TM. The 
Euler-Lagrange equations of L, 

d ( 8L\ dL 

0, 



dt \du a J dx c 

are second-order ordinary differential equations for the extremals. However, the second deriva- 
tives 'x a are not necessarily determined by these equations. We say that L is regular if 

o 2 l 



du a duP ' 



its Hessian with respect to the fibre coordinates, is everywhere non-singular when considered 
as a symmetric matrix. When the Lagrangian is regular the Euler-Lagrange equations may be 
solved explicitly for the x a to give a system of differential equations of the form x a = T a (x,x); 
in turn, these equations can be thought of as defining a vector field T on TM, namely 

r = u a ^— + r a - 



dx a du a 

This vector field, which is an example of a second-order differential equation field, is called the 
Euler-Lagrange field of L. The Euler-Lagrange equations may be written 

dL \ dL 

0; 



,du a J dx a 

they determine T, assuming it to be a second-order differential equation field, when L is regular. 
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In this paper we will assume that L is regular and we will work with the Euler-Lagrange equations 
in terms of the second-order differential equation field V. However, we need to be able to express 
those equations, and the property of being a second-order differential equation field, in terms of 
a basis of vector fields on M which is not necessarily of coordinate type, say {A Q }. A vector 
field is a second-order differential equation field if it takes the form 

r = u a x^ + f a xl 

where the u a are the quasi- velocities corresponding to the basis {X a }. Furthermore, the equa- 
tions 

r(J£(L))-A2(L) = 
are equivalent to the Euler-Lagrange equations. 

We will also need a coordinate-independent expression for the Hessian. In fact the Hessian g 
of L, evaluated at u € TM, is the symmetric bilinear form g u on T m M, m = t(u), given by 
g u {v-,w) = v^{w v (L)), where the vertical lifts are considered as vector fields on T m M. We can 
equally well regard g u as a bilinear form on the vertical subspace of T U TM, by identifying v and 
w with their vertical lifts. Since we assume that L is regular we know that g is non-singular. 

Suppose now that L has a symmetry group G, acting to the left on M in such a way that M is 
a principal bundle with G as its group, as we described above. By saying that G is a symmetry 
group of the Lagrangian we mean that L is invariant under the induced action of G on TM, so 
that £ C (L) = for all £ <G g. A regular invariant Lagrangian determines an Euler-Lagrange field 
which is also invariant. 

We choose a standard basis of vector fields {Xi,E a } on M, as described above. The invariance of 
the Lagrangian can be characterized by the property E£(L) = 0. The Euler-Lagrange equations 
for L are 

T{XUL))-X?(L) = 
V{El{L)) - E c a {L) = 0. 

It follows immediately from invariance that T(E^(L)) = 0, which is to say that the functions 
Eq(L), which we denote by p a , are first integrals of T. In fact the p a can be regarded as 
components of an element of g* , the dual of the Lie algebra g, and the corresponding vector 
is called the momentum. The map TM — > g* by v i— ► (p a (v)) is equivariant between the given 
action of G on TM and the coadjoint action of G on g*. We have 

E c a (p b ) = E c a EUL) = [E c a ,m(L) = -C c ab E-{L) = -C c abPc , 

which expresses the differential version of this result in our formalism. 

The Euler-Lagrange field V is tangent to any level set of momentum, that is, any subset of TM of 
the form p a = [i a = constant, a = 1,2,..., dim G — provided of course that it is a submanifold. 
To describe when this is so we have recourse to the Hessian again. The components of the 
Hessian g with respect to our standard basis will be expressed as follows: 

g(E a , E b ) = g ab , g(X h Xj) = g ih g{X u E a ) = g ia = g ai = g(E a ,Xi) 

(in general these will be functions on TM, not M). Then if (g ab ) is non-singular the equations 
Pa = Ma hi principle determine the v a in terms of the other variables, so the level set of momentum 
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will be a submanifold; we accordingly make the further assumption about L that (g a b) is non- 
singular everywhere. 

We will be working on a level set of momentum, say p a = /x , which we denote by N^. We 
will next define vector fields related to Xf, XJ and E% which are tangent to N^. Since by 
assumption (g a b) is non-singular, there are uniquely defined coefficients A b , B\ and C b such that 

{Xf + A\El)(p a ) = Xf{p a ) + A b g ab = Q 

(X? + B b E^){p a ) = Xy(p a ) + Blg ab = 

{E c a +C b a El){p c ) = E c a {p c ) + C b a9hc = Q. 

Define vector fields Xf, XJ and E% by 

X? = Xf + A-El 

X? = x? + b?e: 

they are tangent to each level set N^. (The notation is not meant to imply that the barred 
vector fields are actually complete or vertical lifts.) We need expressions for the actions of Xf, 
XJ and E% on v l and v a , and for their pairwise brackets. For the former we have 



XY(vJ) = 6l, X?(vi) = Q, 4V) = 0, 

XJ{v a ) = Bf, X?{v") = -Rf jV 3 + Af, E%(v b ) = C b c v c + C b . 

To find the brackets of barred vector fields we argue as follows. The vector fields E% are 
transverse to the level sets, and the barred vector fields span them. Thus on any level set the 
bracket of any two of the barred vector fields is a linear combination of vector fields of the same 
form. Consider for example [E^,XJ\. It is easy to see from the expressions for E® and XJ that 
this bracket is at worst a linear combination of the E^; it follows immediately that [E% , XJ] = 0. 
By similar arguments we can show that the brackets of the barred vector fields just reproduce 
those of their unbarred counterparts, except that [Xf,XJ] = (though we won't actually use 
this fact). 

We will now rewrite the Euler-Lagrange equations T{X^ {L)) — Xf(L) = 0, taking into account 
the fact that V is tangent to the level sets of momentum. For this purpose we introduce the 
function 7Z on TM given by 

K = L- v a Pa . 

Since 1Z generalizes in an obvious way the classical Routhian corresponding to ignorable co- 
ordinates [TQl [13] we call it the Routhian. We have discussed the generalization of Routh's 
procedure to arbitrary regular Lagrangians with non-Abelian symmetry groups elsewhere [3]; 
we must repeat the derivation of the expression of the remaining Euler-Lagrange equations in 
terms of 1Z. 

To obtain the desired equations we first express Xf{L) and X^ (L) in terms of the barred vector 
fields and the Routhian, as follows: 

Xf(L) = X?(L)-AtEZ{L) 

= Xf(L - v a p a ) + (-2$V + At) Pa + v a Xf (p a ) - A a iPa 

= Xf{U)-p a Rl^- 

XJ{L) = XUL)-BtK(L) 

= XJ(L - V a Pa ) + Blp a + V a X?( Pa ) - Blp a 
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Thus if we denote by TV" the restriction of the Routhian to the submanifold Nu (where it becomes 
L — v a fj, a ), taking account of the fact that T is tangent to we have 

These are the reduced Euler-Lagrange equations, or the generalized Routh equations as they 
are called in [3]. 

Since V satisfies T{jp a ) = it may be expressed in the form 

r = v i xf +rl ! v + v a E°. 

If the matrix- valued function XY(XJ(TZ)) is non-singular, the generalized Routh equations will 
determine the coefficients T\ We show now that this is the always under the assumptions 

that L is regular and that (g a b) is non-singular. 

Recall that Xf = XJ + BfE^ is determined by the condition that XJ (p a ) = 0; it follows that 
Bf = —g ab gib, where (g ab ) is the matrix inverse to (g a b)- Now XJ(Jl) = X?(L), so 

XJ{X]{TI)) = (X? - g ah g lb El){X](L)) = 9ij - g ab g ta g jb . 

It is a straightforward exercise in linear algebra to show that under the stated conditions the 
matrix with these components is non-singular. 

4 Relative equilibria 

Consider an autonomous second-order differential equation field T on the tangent bundle TM of 
a manifold M. Let t *— > j(t) be a base integral curve of T, that is, a curve on M whose natural 
lift t i — > (7(t),7(i)) to TM is an integral curve of V. The curve 7 is uniquely determined by its 
initial conditions (7(0), 7(0)) and the fact that it is a base integral curve. 

Now suppose that a Lie group G acts to the left on M in such a way that M is a principal 
G-bundle, ir : M — > B; and suppose that T is invariant under the induced action of G on TM. 
Then G maps base integral curves of T to base integral curves; and for g G G, t *—* ipg 1 {"l{t)) is 
the base integral curve with initial conditions i/jJ m (7(0)), 7(0)). 

A base integral curve 7 is a relative equilibrium of V if it coincides with an integral curve of a 
fundamental vector field of the action of G on M, that is, if j(t) = V^ p (t£) ( m ) f° r some m E M, 

£ € 0; of course m = 7(0), and 7(0) = £ m . A relative equilibrium is a curve in a fibre of 
7r : M — > B, so that 7r(7(i)) is a fixed point of B\ but not all curves that project onto fixed 
points of B are relative equilibria. Evidently if 7 is a relative equilibrium, so is ijig 1 o 7 for any 

The base integral curve 7 is a relative equilibrium if and only if its natural lift coincides with an 
integral curve of a fundamental vector field of the induced action of G on TM. That is to say, 
if an integral curve of the vector field T coincides with an integral curve of £ c for some £ G Q 
then the corresponding base integral curve is a relative equilibrium, and conversely. But we are 
now dealing directly with an invariant vector field, namely T; by invariance, the integral curve 
of r through v G TM will coincide with that of £ c if and only if T v = £° . Thus finding relative 
equilibria is a matter of locating points v G TM with the property that r„ = £° for some £ G 0; 
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we call such points relative equilibrium points. We will shortly address the problem of finding 
relative equilibrium points for the Euler-Lagrange field of an invariant Lagrangian. 

Recall that in the absence of symmetry, the equilibrium points of a regular Lagrangian — the 
zeros of its Euler-Lagrange field — are just the critical points of the energy. It may be worth 
seeing why, for comparison with what follows. Let (x a ,u a ) denote coordinates on TM. If £ is 
the energy of a Lagrangian L, so that 



then 



d£ a d 2 L dL , d 2 L a ( d 2 L d 2 L 



dx a dx a duP dx a du a duP \dx a dul 3 dx?du a 

d£ B d 2 L 



du a du a duP ' 

and the critical points of £ are precisely the points where u a = and T a = 0. 

We will use these remarks as a guide to the formulation of a similar result about relative 
equilibrium points in the Lagrangian formalism. The energy £ of the Lagrangian L is given by 
£ = A(L) — L, where A is the Liouville field. We note first that since [A, Z°] = for any vector 
field Z on M, when L is invariant £ is also invariant. 

We want an expression for the energy £ of a Lagrangian in terms of a standard basis, for which 
we need to know how to write A with respect to such a basis: the obvious guess, namely 

A = v i xy + v a EZ, 

is in fact correct. Thus the energy of L is 

£ = A(L) — L = v'xy(L) — (L — v a p a ) = ^X^L) - K. 

We showed above that XJ(L) = X^iTZ), so we can write this as 

£ = v l xy{n) -K. 



Next we derive expressions for the derivatives of £ along the barred vector fields. In the first 
place, 

xU£) = v^xnx](n)). 

Secondly we have E^(L) = (E% + C a 6 £X)(L) = C b aPb , so that E°(K) = C b aPb - Pb (C b ac v c + C b a ) = 
—C b c p b v c , whence 

E c a {£ ) = E^xnn) -n) = v'EZixyyi)) - E c a {n) = c b acPb v c + s m v\ 

where S a i stands for an expression whose details will not concern us. Finally, 

Xf{S) = X?{viX]{1l) -U) = viXf{X]{1l)) - Xf{K). 

From the generalized Routh equations Y{XJ{JV)) —Xf(7l) = — jj, a R^-v^ with V expressed in the 
form T = v l Xf + rXj + v a E%, we obtain 

Xf{K) = VXJ{X]{U)) + v a E%xy(K) + T ijV i 
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where the exact form of Tjj will again be of no concern. Now 

E c a (X?(K)) = XUE c a (lZ)) = -XUC b acPb v c ) = -C h ac p b XUv c )- 

Thus 

Xf{£) = -TiX?(Xj(K)) + VijiP + V?v a C b acPb , 

where Uij and Vf will likewise be of no particular immediate interest (though in fact Vf = —Bf). 

We next consider the conditions for a relative equilibrium. The integral curve of T through a 
point (m,v l ,v a ) of TM will coincide with the integral curve of some £ c , £ G Q, if and only if 
T{m,v % ,v a ) = £ c (m, v l , v a ), that is, if and only if at that point 

v'x? + rxy + v a E° = ce^. 

Thus the integral curve of V through a point (m,v' l ,v a ) of TM will coincide with the integral 
curve of some £ c if and only if 

t/ = 0, v a =C, r(m,0,£ a ) = 0; 

and moreover we must have S, a E^ = (, a E^, which just says that the integral curve of £ c must lie 
in the level set containing the point (m, v l ,v a ), as does the integral curve of T. Let us assume 
that we are on the level set p a = fj, a ; then this last condition becomes 

ec c ab » c = o. 

We can now prove that the relative equilibrium points lying in any level set Nn are just the 
critical points of the restriction of £ to N^, assuming as before that L is regular and (g ab ) 
is non-singular. 

Suppose first there is a relative equilibrium point in AL: it is a point (m, v l ,v a ) such that v l = 0, 
v a C^ b fi c = and T l (m,0,v a ) = 0. From the formulae for the derivatives of £ obtained above, 
we have = E^{£^) = X2{£^) = at (m, 0,v a ), and since these vector fields span the 

tangent distribution to the level set, the point is a critical point of £^. 

Conversely, suppose that a point (m,v t ,v a ), lying in A^, is a critical point of £^, so that 
XJ{£^) = Eq(£^) = Xf{£^) = there. Since by assumption the symmetric-matrix-valued 
function XJ{XJ(JV)) is non-singular, we find from the condition XJ{£^) = that v % = 0; from 
the condition = we obtain v a C c ah ii c = 0; and from the condition Xf{£^) = we deduce 

that r*(m, 0, v a ) = 0. The integral curve of T through the point therefore coincides with that of 
£ c where £ a = v a . 

As we have mentioned, the condition £ a C^ b /i c = states that the fundamental vector field £ c 
is tangent to the level set AT„. There is another way of interpreting this condition. We pointed 
out earlier that the map v *— > (p a (v)) is equivariant between the given action of G on TM and 
the coadjoint action of G on g*. For any fi S 0* we denote by the isotropy group of fi under 
the coadjoint action, and 0^ its Lie algebra. By equivariance, £ G if and only if £ c is tangent 
to N^. Thus ^ a C c ah [i c = is also the necessary and sufficient condition that £ G Q^. 

5 Lewis's criterion for a relative equilibrium 

We next discuss the somewhat different criterion for the existence of a relative equilibrium given 
by Lewis in |9]. Lewis defines the locked Lagrangian for any £ € Q, L^, by 
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thus is a function on M. She shows that a point (m, 0, £ a ) of TM is a relative equilibrium 
point, for a regular Lagrangian, if and only if m is a critical point of L^. We now establish a 
similar result by our methods. 

The first task is to relate the derivatives of to those of L. For this purpose it is helpful to 
observe that the specification of can be regarded as a particular case of a general construction. 
Let F be any function on the tangent bundle TM of some manifold M, and X any vector field 
on M. Then X is, or defines, a section of TM — > M, which we will denote by ax for clarity; 
and we can use such a section to obtain from F a function Fx on M by pull-back: Fx = ff^F. 
The locked Lagrangian is an example of this construction, with F = L, X = £. 

We require a formula for Y(Fx), the derivative of Fx along any other vector field Y on M. Now 
there is a unique vector field Tax(Y) on the image of the section ax which is tangent to it and 
which projects onto Y. In fact for any v 6 T m M, say v a d/dx a , the vector 

d dX a d 

is the unique vector which projects onto v and is tangent to the section. Thus 

T« X <Y)=Y° A + Y ^° » 



dx a dxl 3 du a ' 

Notice that we can express the right-hand side as 

ya J_ + Y pdJT _d_ = ya _d_ + x ^ _d_ _ ( xP dY« _ yfJ dX" \ 



dx a dxl 3 du a dx a dx? du a \ dx? dx? J du a ' 

and this is just the restriction to the image of ax of the vector field Y° — [X, Y] v , a vector field 
which is defined globally on TM. Thus 

Y(F X ) = Y(a* x F) = a* x (Ta x Y(F)) = a\ ((Y c - [X,YD(F)) . 

We now use this result to obtain expressions for the derivatives of along the local basis vector 
fields E a , Xi on M. We have E%{L) = 0, while [f , E a ] = C^ b E c , whence 

E a m) = -c%^\ipc). 

On the other hand 

X i {L i )=a\{Xf[L)% 

because Xi is invariant under the G-action. But from the Euler-Lagrange equations Xf(L) = 
T{XJ{L)) = T(XJ{K)). So finally, at any m G M, 

where we have set p a (m, 0,£ fe ) = fi a . 

Now suppose that (m, 0, £ a ) is a relative equilibrium point on the level set N^. Then as we saw 
earlier, i a C c ab ^i c = 0, so E a \ m (L^) = 0. Furthermore, E^{XV{n)) = -C b c p b Bf as we showed 
before, and T = £, a E^ by assumption, so 

XiULtf = r(^(R))(m,0,f) = eE c a {Xj{n)){m^e) = -CC b ac n b Bf{m,0,C) = 0- 
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Thus mis a critical point of Lg. 

Conversely, suppose that m is a critical point of L^. Then C° b £ fj, a = 0, so £ c is tangent to the 
level set on which (m, 0, £ a ) lies. Furthermore, we have T(X^ (lZ))(m, 0, £ a ) = 0. Recall that 
r = viX? + + v a E%; it follows that 

r(X?(K))(m, 0, £ a ) = P(m, 0, 0, £°) + (ft))(m,0,e°). 

But e°^(xy(^))(m,o,e) = -ec a b c ^ c (^,o,e) = o, so 

P(m, 0, e a )^ v (X/(^))(m, 0, ^) = 0. 

Since by assumption X^XJ(TZ) is non-singular, we have P(m, 0,^ a ) = 0, and (m,0, £ a ) is a 
relative equilibrium point. 

If one is looking for relative equilibria with a given value of the momentum \i it is appropriate 
to use the first method (searching for critical points of the restriction of the energy function to 
the level set No); if one is looking for relative equilibria with a particular value of £ € Q then 
the method described above is more suitable. 

6 Some applications 

6.1 Systems on Lie groups 

We now specialize to the case of an invariant Lagrangian system on a Lie group G. For such a 
system there are no conditions for relative equilibria arising from the Xi, so the only condition 
for a point £ 9 G TG to be a relative equilibrium point is that £ 6 C^ b /i c = where n is the value 
of the momentum at Thus the necessary and sufficient condition for a relative equilibrium 
takes either of the following equivalent simple forms: £ 9 is a relative equilibrium point if and 
only if the vector field £ c is tangent to the level set of momentum in which the point £ 9 lies, or 
equivalently if and only if £ € g„, the algebra of the isotropy subgroup of the momentum. 

In the present case the fact that the Lagrangian is invariant means that the dynamical system 
on TG is determined by its reduction to T e G ~ Q. That is to say, the Euler-Lagrange equations 
can be reduced to an equivalent set of equations on Q, the so-called Euler-Poincare equations 
|10j . which can be written 

dt \d^J a6? d£ c ' 

here I is the restriction of L to T e G, thought of as a function on Q, and the £ a here are the 
Cartesian coordinates on g determined by the basis {E a }- These equations, which are first- 
order differential equations in the variables £ a , determine in the regular case a vector field 7 on 
g from which the Euler-Lagrange field T on TG can be reconstructed. In fact a curve t 1— > g(t) 
in G is a base integral curve of T if and only if the curve t 1— > Tijj^ t ^_ 1 g{t) in g is an integral 
curve of 7. 

In this picture the relative equilibria are simply constant solutions of the Euler-Poincare equa- 
tions, and these are points £ of g at which 
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A solution of these equations determines a relative equilibrium starting at e, or in other words a 
base integral curve of T which coincides with a 1-parameter subgroup of G; but since translates 
of relative equilibria are relative equilibria, this is enough to give all relative equilibria. Now 

Pc\r e G = E^(L)\ TeG = 

so the two approaches give the same results so far as relative equilibria through the identity are 
concerned. 

We discuss next the relations between the general criteria for finding relative equilibrium points 
obtained earlier and the observations above. In order to do so we must first consider the identi- 
fication of TG with G x g. Since we are working with left actions the fundamental vector fields 
are right, not left, invariant, so the use of quasi-coordinates relative to a basis of fundamental 
vector fields amounts to identifying T g G with T e G by right rather than left translation. On the 
other hand, when we say for example that L is invariant we mean that it is invariant under 
left translations. Under left translation, £ g is identified with ad g -i £. For any right-invariant 
function F we have F(£ g ) = F(ad fl -i £| e ). So if we denote by / the function on g obtained by 
restricting F to T e G (and identifying T e G with g), then F(£ g ) = /(ad g -i £). 

The energy £ in this case is just 

£(ig) = Z a p a (ig)-Hig) 
(so £ happens to coincide with —TV). Now £ is left-invariant, and e, its restriction to g, is just 

Notice that ^ 

where g a t is the restriction of g a b to T e G ~ g; by assumption, the matrix (g a b) is non-singular 
everywhere on g. 

The relative equilibrium points are the critical points of £^ , the restriction of £ to the level 
set of momentum N^. To express this result in terms of e we must determine those points 
(<?,£) € G x g ~ TG which lie in iV M . Now it follows from the regularity assumptions that 
is (the image of) a section of TG — > G, so that for each g G G there is a unique ^ G g such that 
(<7, ^) € N^. It follows from equivariance that g and ^ must satisfy ad*_i j>(£ e ) = |U, or 

^(0 = (ad>).- 

This defines a map G — > g, which is constant on left cosets of G^, the isotropy group of \i 
under the coadjoint action. Let g{p) C g be the image of G under this map. Then the relative 
equilibrium points in T e G with momentum [i are the critical points of e restricted to g(p). 

Now consider any curve in N^, given in the form t i— > (g(t), £(t)), such that g(0) = e; we set 
£ (0) = £o and note that 

^UoJ = Ma- 
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By differentiating the condition 

dl 

a^(«*)) = K(*)/*)« 

with respect to t and setting t = we obtain 

where r/ is the tangent vector to t h- > g(t) at t = 0, considered as a point of 0. We may choose ry 
arbitrarily, and determine £(0) from this equation. The tangent vectors to g(/_i) at £o are those 
of the form 

g ac {io)ri b ci^ a . 

It follows that £o wm be a critical point of e\g(jj) if and only if 

5 ac (£o)r? b C^— (&) = 
for all 7]. This gives back the same condition as before. 

This approach is similar in spirit to that discussed by Arnold [lj, and indeed generalizes that 
approach insofar as the finite-dimensional case is concerned since Arnold deals only with kinetic 
energy Lagrangians defined by Riemannian metrics. 

The locked Lagrangian for a system on a group G is given by L^(g) = L(£ g ), for fixed £. It 
follows from the invariance assumption that L^(g) = Z(ad ff -i £). We can think of the right-hand 
side as the restriction of I to the orbit of £ under the adjoint action of G on g, which we denote 
by that is, = 1\g(£)- According to Lewis's criterion, (<?,£) is a relative equilibrium point 

if and only if g is a critical point of L^. Let G^ be the isotropy group of £ under the adjoint 
action; then G(£) — G/G^. Clearly is constant on the fibres of the projection p : G — > G/G%, 
from which it follows that g is a critical point of if and only if p(g) is a critical point of 1\g(£)- 
Thus in this case Lewis's criterion can be restated in the following form: (g, £) is a relative 
equilibrium point if and only if p{g) is a critical point of Z|c7(f)- Lewis's criterion again reduces 
to the condition ^ b C^ b p c = 0, or more succinctly ([77, £],//) = for all 77 € g. The role of G^ is 
revealed here by the observation that this condition is automatically satisfied if [77, £] = 0, that 
is, if 77 lies in the centralizer of £: but this is exactly the algebra of G^. 

There is yet another way of arriving at the condition ^ b C^ b p c = 0. The fundamental vector 
fields E a are the right translates of the E a , considered as elements of T e G; they are not of course 
left-invariant. We denote by E a the left translates of the E a , which are left-invariant. The 
relation between these two sets of vector fields on G can be written E a = A^E},; the coefficients 
are the matrix components of the adjoint map, and the condition of invariance gives 

E a (A c b )-C d ab A c d = 0, 

where of course A b = 5 b at e. Now for any vector field Y and function / on a manifold M, 

(fX) c = fX c + fX v , 
where / is the so-called total derivative of /, a function on TM given by 

/ = u«^L = v * Xaif) 
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for a vector field basis {X a } with associated quasi- velocities v a . Thus 

E c a = A h a Et + eE c (A b a )El = A b a E% + eC d ca A b d E^ . 
It follows that at the identity 

E c a {L)=eC b caPb . 

The necessary and sufficient conditions for £ to define a relative equilibrium at e may therefore 
be written E°(L)(e,Z) = 0. 

We note in passing that if the Lagrangian is bi- invariant, that is, invariant under both left and 
right translations, so that E£(L) = everywhere (as well as E£(L) = 0), then all base integral 
curves of T through e coincide with 1-parameter subgroups, and therefore all base integral curves 
are translates of 1-parameter subgroups. These curves are just the geodesies of the canonical 
torsionless connection on G, which is defined by 

^E a E b = \ C< ab E c- 

The Euler-Poincare equations reduce to 

r c p b^}_ _ n 

Conversely, if L is left-invariant (E£(L) = 0) and all base integral curves of its Euler-Lagrange 
field r are translates of 1-parameter subgroups then L must be bi-invariant. For it must certainly 
be the case that E°{L)(e,0 = for all £ G g. But E^E^(L) = E%E°(L) = 0, so E°(L) = 
everywhere. It is a well-known property of invariant Riemannian metrics on a Lie group that 
the exponential map determined by the Levi-Civita connection coincides with the exponential 
in the group sense if and only if the metric is bi-invariant. The result above is a generalization 
of this property to regular invariant Lagrangians. 

The problem of the existence of relative equilibria for invariant systems on Lie groups has been 
studied recently by several authors, using differing terminology: Hernandez-Garduho et al. [5] 
(for kinetic energy Lagrangians, i.e. geodesies of an invariant Riemannian metric on a Lie group); 
Latifi [7J (for invariant Finsler structures, under the name 'homogeneous geodesies'); Szenthe 
[16j (for a general invariant Lagrangian, under the name 'stationary geodesies'). Our results 
above incorporate the particular cases in [5] and [7]. Furthermore, our results improve on those 
of Szenthe [16] in that we do not require one of the hypotheses, namely that the Lagrangian is a 
first integral of its Euler-Lagrange field, in both his Proposition 2.2, which (in different notation) 
gives the condition for a relative equilibrium in the form E^(L)(e,£) = 0, and his Theorem 2.3, 
which gives the condition in terms of critical points of 1\g(£)- 



6.2 Simple mechanical systems 

A simple mechanical system is a Lagrangian system in which the Lagrangian takes the familiar 
form L = T — V , where T is the kinetic energy associated with a Riemannian metric g on M 
and V is the potential energy, a function on M. Such a Lagrangian is necessarily regular since 
its Hessian is effectively just the Riemannian metric. 

In the case of a simple mechanical system we take as symmetry group G the group of diffeo- 
morphisms of M which are isometries of the metric and leave the potential invariant. We must 
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assume of course that G acts freely and effectively on M. We define the invariant vector fields Xi 
of a standard basis as follows. The orthogonal complements to the tangent spaces to the fibres 
of the principal bundle M — > B are the horizontal subspaces of a principal connection, called 
the mechanical connection. The Xi are the horizontal lifts to M, relative to the mechanical con- 
nection, of the vector fields of some local basis on B. We write g a b = g(E a , Eb), gij = g(Xi,Xj); 
by assumption, g ai = g(E a ,Xi) =0. Thus 

L(m,v) = \ (gij(m)v l v j + g a b(m)v a v b ^ - V(m), 

where the v s are the quasi-velocities associated with the standard basis, as before. It is clear 
that the g a b etc., which are here defined as components of the metric, are also the appropriate 
components of the Hessian of L. Since we assume that L is regular the matrix (g a b( m )) is 
necessarily non-singular in this case. 

In the case of a simple mechanical system the components of momentum are given simply by 
Pa = 9ab vb - The restriction of the Routhian to a level set of momentum is 

TV* = ±s^V - (V + \g ab ^b) = y i3 v i v j - V"; 

V^ 1 is the so-called amended potential [H]. The restriction of the energy to a level set is given 
by 

= l 9ij v l vi + \g ab ^b + V = \gijv x v> + V. 
The relative equilibrium points on the level set are determined by the critical points of £^ , and 
these are points of the form (m, 0,£ a ) where £ a = g ab ^b an d rn is a critical point of V^. Now 
one of the conditions for a relative equilibrium point is that /z c = C c ad g bd {rn)[ib[ic = 0; this 
is in fact included in the condition for m to be a critical point of V\ To see this, note that 

E a {g bc ) = -g bd 9 ce E a (g de ) = g bd g ce (c{ d g ef + C[ e g df ) = g bd C c ad + g ce C b ae . 

It follows that 

E a {\g bc wc) = g bd C c adW c 

as required. So if (m, 0,£ a ) is a relative equilibrium point on the level set p a = fi a , then 
[i-a = gab{m)i b , and m must be a critical point of the amended potential . Conversely, if m is 
a critical point of then (m, 0,£ a ) is a relative equilibrium point, where £ a = g ab (m)nb- 

On the other hand, the locked Lagrangian is given by 

H = habCC b - V; 

the quantity V — \g a b£, a £, b is the augmented or effective potential [14J, V£. Then (m, 0,£ a ) is a 
relative equilibrium point if and only if m is a critical point of V£. Notice that for any w € T m M, 

w{g ab ) = -g ac g bd w(g C d), so that if e = g ab ^ b 

W (V") = w(V) - g ac g bd w(g cd )^ b = w(V) - w{g cd )et d = w(V^), 

so the two criteria for the existence of a relative equilibrium point are consistent. 

Since fj, a = g a b(m)£ b , the condition C^ b fi c = can be written in the form C c ad g bd {m)\ib\^c = 0, 
as we have already observed, and also in the form C^ b ^ b g c d{m)^ d = 0. Now g a b may be regarded 
as defining a function on M taking its values in the space of symmetric bilinear forms on g, in 
the sense that for any m G M, (g a b(m)) is the matrix of such a bilinear form with respect to 
the basis {E a } of g. With this interpretation we can express the condition C^ b ^ b g cd (m)^ d = 
equivalently as g(m)(£, [£, rj\) = for all rj € 0. This generalizes a result of Szenthe's |15j for the 
case of an invariant Riemannian metric on a Lie group, when this condition with m = e is the 
only condition for £ to determine a relative equilibrium through the identity. 
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6.3 Saari's conjecture 



We continue to discuss the case of a simple mechanical system. 
The matrix- valued function (g a b) is called the locked inertia tensor. 

It has been conjectured (see [5j[8]), on the basis of certain results for the iV-body problem, that 
'a Lagrangian simple mechanical system with symmetry is at a point of relative equilibrium if 
and only if the locked inertia tensor is constant along the integral curve that passes through 
that point'. The original version of this conjecture, in the context of the iV-body problem, was 
formulated by Saari; the version above is called the naive generalization of Saari's conjecture. 

It is evident from the formula 

i( 9bc ) = eK(9bc) = - (ec d ah9cd + ec d ac9bd ) , 

which is part of Killing's equation for £, that if at a relative equilibrium point (m, 0, £ a ) we have 
i a C^ b = (and not just ^ a C^ b fi c = = Cf b g cd ^ a ^ c ) then the locked inertia tensor is constant 
along the corresponding integral curve. That is to say, if (m, 0,£ a ) is a relative equilibrium 
point for which £ belongs to the centre of g then the locked inertia tensor is constant along the 
integral curve. On the other hand, our analysis suggests that it is unlikely that in general the 
locked inertia tensor is necessarily constant along the integral curve of a relative equilibrium. 
So it seems unlikely that Saari's conjecture holds in all generality; and indeed it is known to be 
false. In a refined version of Saari's conjecture formulated in [51 [8] it is required only that g a b£ b is 
constant along the integral curve. It is clear that when £ does define a relative equilibrium point, 
SW>£ 6 is constant along the integral curve, because g a b£, b = is the value of the momentum at 
the relative equilibrium point and the integral curve through the point lies in the same level set. 
In fact, even in the case of a general Lagrangian we have 

l c {gu c ) = -{ec d ab g cd +ec d ac g bd )e = -c d ab9cd ee 

(using a formula which generalises the one at the beginning of this paragraph) , from which it is 
clear that gbc£ c is constant along an integral curve of £ c if and only the condition C^ b g cd ^ a ^ c = 
holds. This is indeed a requirement for a point to be a relative equilibrium point, in the case of 
a simple mechanical system; however, in general there is a further requirement involving critical 
points of the augmented potential. But in the case of an invariant simple Lagrangian on a Lie 
group C i b g cd ^ a ^ c = is the only condition for £ to be a relative equilibrium point, so invariant 
simple Lagrangians on Lie groups belong to the class of Lagrangian systems with symmetry for 
which the refined Saari conjecture holds, as is pointed out in [5]. 
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